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|Attomey's Docket Number 

TRANSMITTAL LETTER TO THE UNITED STATES 1 056291-5073 

DESIGNATED/ELECTED OFFICE (DO/EO/US) |U.S. Application No. m y , , ~ 

CONCERNING A FILING UNDER 35 U.S.C. § 371 I Unassiqned 1 0* Q Q 9 0 " ^ 

International Application. No. | International Filing Date jPriority Date Claimed 

I I 



gPCT/GBOO/03100 



| August 15. 2000 



| August 21. 1999 



e of Invention: PROTEIN 



Applicants For EO/EO/US: John David Charles ROSAMUND and Norbert Friedemann SCHNELL 



Applicants herewith submit to the United States Designated/Elected Office (DO/EO/US) the following items and 
other information: 

1 . £3 This is a FIRST submission of items concerning a filing under 35 U.S.C. § 37 1 . 

2. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 

35 U.S.C. § 371. 

3. tZl This express request to begin national examination procedures (35 U.S.C. § 371(f)) at 

any time rather than delay examination until the expiration of the applicable time limit 
_ set in 35 U.S.C. § 371(b) and PCT Articles 22 and 39(1). 

4. A proper Demand for International Preliminary Examination was made by the 19th 
month from the earliest claimed priority date. 

5. . [X] A copy of the International Application as filed (35 U.S.C. § 371(c)(2)) 

a. □ is transmitted herewith (required only if not transmitted by the 
_ International Bureau). 

b. ^ has been transmitted by the International Bureau. 

c. is not required, as the application was filed in the United States 
Receiving Office (RO/US). 

6. n A translation of the International Application into English (35 U.S.C. § 371(c)(2)). 

7. ^ Amendments to the claims of the International Application under PCT Article 19 

(35 U.S.C. § 371(c)(3)). 

a. are transmitted herewith (required only if not transmitted by the 
International Bureau). 

b. Q have been transmitted by the International Bureau. 

c. S have not been made; however, the time limit for making such 

amendments has NOT expired. 

d. EH have not been made and will not be made. 

8. O A translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 

§ 371(c)(3)). 

9. [X] An oath or declaration of the inventors (35 U.S.C. § 371(c)(4)). 

10. [H A translation of the annexes to the International Preliminary Examination Report 

under PCT Article 36 (35 U.S.C. § 371(c)(5)). 
Items 11. to 14. below concern other document(s) or information included: 

11. □ An Information Disclosure Statement under 37 C.F.R. § 1 .97 and § 1 .98. 

12. £x] An assignment document for recording. A separate cover sheet in compliance with 
_ 37 C.F.R. § 3.28 and § 3.3 1 is included. 

13. □ A FIRST preliminary amendment. 

□ A SECOND or SUBSEQUENT preliminary amendment. 

14. [X] Other items or information: 

a. WO 01/14533 

b. PCT/IB/308 

c. International Search Report w/Search Report 

d. Paper Copy of Sequence Listing 

e. PCT/IB/409 
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U.S. APPLICATION NO. INTERNATIONAL APPLICATION NO. | ATTORNEY DOCKET NUMBER 
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15. [X] The following fees are submitted: 

Basic National Fee (37 C.F.R. § 1.492(a)(l)-(5)): 

Search Report has been prepared by the EPO or JPO $890.00 

International preliminary examination fee paid to 
USPTO (37 C.F.R. § 1.482) $710.00 

No international preliminary examination fee paid to 
USPTO (37 C.F.R. § 1.482) but international search fee 
paid to USPTO (37 C.F.R. § 1.445(a)(2)) $740.00 

Neither international preliminary examination fee 
(37 C.F.R. § 1.482) nor international search fee 
(37 C.F.R. § 1.445(a)(2)) paid to USPTO $1,040.00 

International preliminary examination fee paid to USPTO 
(37 C.F.R. § 1.482) and all claims satisfied provisions 

of PCT Article 33(2)-(4) $100.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 


$890.00 


Surcharge of $130.00 for furnishing the oath or declaration later than 
□ 20 EE] 30 months from the earliest claimed priority date 
(37 C.F.R. S 1.492(e)). 


$ 


Claims 


Number Filed 


Number Extra 


Rate 




Total Claims 


15-20 = 


0 


X $18.00 


$ 


Independent Claims 


4 - 3 = 


1 


X $84.00 


$ 84.00 


Multiple dependent claim(s) (if applicable) 




+ $280.00 


$ 




TOTAL OF ABOVE CALCULATIONS 


$ 


Reduction by 1 / 2 for filing by small entity, if applicable. 
Verified Small Entity statement must also be filed. (Note 37 C.F.R. §§ 1 .9, 1 .27, 1 .28) 


-$ 


SUBTOTAL = 


$ 


Processing fee of $130.00 for furnishing the English translation later 

than □ 20 □ 30 months from the earliest claimed priority date (37 C.F.R. § 1 .492(f)). 


+$ 


TOTAL NATIONAL FEE = 


$974.00 


Fee for recording the enclosed assignment (37 C.F.R. § 1.21(h)). 
The Assignment must be accompanied by an appropriate cover sheet 
(37 C.F.R. 3.230, 3.31 ). $40.00 per property 


$ 40.00 


TOTAL FEES ENCLOSED = 


$ 1014.00 


Amount to be refunded 


$ 


Amount to be charged 


$ 1014.00 



a. 
b. 
c. 



□ 



A check in the amount of $_ 



to cover the above fees is enclosed. 



Please charge my Deposit Account No. 50-03 10 in the amount of $1014.00 
Except for issue fees payable under 37 C.F.R. § 1 . 1^7^>pommissioner is hereby 



authorized by this paper to charge any additional 
application including fees due under 37 C.F.R. § 
credit any overpayment to Deposit Account No 



pendency of this 
f ich may be required, or 



Customer No. 09629 
SEND ALL CORRESPONDENCE TO: 
Morgan, Lewis & Bockius LLP 
1111 Pennsylvania Avenue, N.W. 
Washington, D.C. 20004 
Telephone: (202) 739-3000 
Facsimile: (202) 739-3001 




Donal 

Reg."No. 25,323 



Submitted: February 21. 2002 
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10/06906? 

PATENT 

ATTORNEY DOCKET NO. 56291-5073 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re Application of: 

John David Charles ROSAMUND et aL 

Application No.: 10/069,062 

Filed: February 21, 2002 

For: PROTEIN 

BOX SEQUENCE 

Commissioner for Patents 
Washington, DC. 20231 

STATEMENT ACCOMPANYING SEQUENCE LISTING 

The undersigned hereby states upon information and belief that the Sequence Listing 
submitted concurrently herewith does not include matter which goes beyond the content of the 
application as filed and that the information recorded on the diskette submitted concurrently 
herewith is identical to the written Sequence Listing submitted herewith. This submission is 
responsive to a Notification of Missing Requirements dated May 20, 2002. 



Group Art Unit: Not Assigned 
Examiner: Not Assigned 



Dated: June 14, 2002 



Respectfully submitted, 

MORGAN, LEWIS & BOCKIUS LLP 

By: ^^OaMS 3 - I^Wf^ 
Rachel B. Kapust 



Customer No. 09629 

MORGAN, LEWIS & BOCKIUS LLP 

1111 Pennsylvania Ave., NW 

Washington, DC 20004 

Tel: 202-739-3000; Fax: 202-739-3001 
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SEQUENCE LISTING 



<110> ROSAMUND, John David Charles 
SCHNELL, Norbert Friedemann 
ASTRAZENECA AB 

<120> PROTEIN 

<130> 56291-5073 

<140> US 10/069,062 
<141> 2002-02-21 

<150> PCT/GBOO/03100 
<151> 2000-08-15 

<150> GB 9919766.7 
<151> 1999-08-21 

<160> 11 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 547 
<212> DNA 

<213> Candida albicans 
<400> 1 

ccaatggaaa aaggaaaagc cagaagaaag ctctgttgtg tatgaccagc ttaatagtgc 60 
caatttacag tttatgaagg aattgaggga aatgcgtgaa aaatacgact cagacccaga 120 
gacttatatt aaagagttag atcattctgt tgagcctttg actgttgcga ttaagaacat 180 
cagaaaaggg ttacaagcat taacacaaaa atcagaggtt ccaattgaac ctgatgtcca 240 
aacccagttg ttggaccgtt gtcaagagat tcctggttgt gttggcggtg tggttccagg 300 
tgctggtgga tacgatgcaa tagctgtatt agtgttggaa aatcaagtgg gaaattttaa 360 
gcagaaaact cttgaaaatc cagattattt tcataatgtt tactgggttg atttggaaga 420 
gcaaacagaa ggtgtacttg aagaaaaacc agaagactat ataggtttat aaaatatcac 480 
taggatatgt ctacaaggtg atttcgatta gattttctgc tacccgtttt aacatatttt 540 
acttcaa 547 



<210> 2 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Single-stranded oligonucleotide 
<400> 2 

gctggtggat acgatgcaat a 21 



<210> 3 
<211> 577 
<212> DNA 

<213> Candida albicans 



1 



1, 0-O IB gOfeS , 3 022102 



<220> 

<221> misc_f eature 
<222> (1) . . (577) 
<223> n can be a or 



g or c or t 



<400> 3 

atgtacatct 

ggaacttctc 

atcttctata 

ttttttgggc 

tgataacatt 

ccgatgaacc 

ttttttctcc 

ccgagtaaat 

gatggatata 

ngtaattgat 



ttcatgtttg 
aggatcactt 
tacaatcaga 
ataacaatgt 
ggggataaaa 
taatccggtc 
attcgaggat 
aatgatttca 
gttgcctcta 
gatatgtgat 



aattcccagt 
tctagaacct 
ccataaattg 
gcaatctgtg 
tgggataata 
ttttccactt 
gtcttggttt 
agatcaaatg 
aaaatgggat 
antcccattc 



tacttgcaat 
gaaacacgtc 
cagttgcaac 
caacgttgtg 
aacttgtggc 
cggtaatggc 
cagtatcttc 
cttcggtcgg 
ttatgcgtga 
cccggtt 



caattttttc 
atttatcaaa 
atcaaaccca 
caaaatatct 
aacaactgac 
acgagaatgg 
ttgtgaatga 
ttgaatataa 
ctgnacttct 



aactctgtgg 
gctggctgaa 
gatcctatct 
ttattcgtac 
actaatcctg 
taaagaaaag 
tatccagggt 
gctaaaaacc 
ttgggttttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

577 



<210> 4 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Single-stranded oligonucleotide 



<400> 4 

ggggataaaa tgggataata aactt 



25 



<210> 5 
<211> 1763 
<212> DNA 

<213> Candida albicans 



<400> 5 

gtggaaaaaa 

tttctttttc 

atttcttgct 

gtcat cacga 

caaaatttct 

agagaagccc 

cgttttagct 

ccctggatat 

atttctttac 

aggattagtg 

tacgaataaa 

gataggatct 

tcagccagct 

cacagagttg 

accatacgga 

ggtatcacga 

ccagcttaat 

cgactcagac 

tgcgattaag 



aagacagaac 
tccgatcatc 
ggtggatatt 
atgcatgcag 
tcaccccaat 
agagaagttc 
tatattcaac 
cattcacaag 
cattctcgtg 
tcagttgttg 
gatattttgc 
gggtttgatg 
ttgataaatg 
aaaaaattga 
atcaagttat 
gtactccaat 
agtgccaatt 
ccagagactt 
aacatcagaa 



agtagattcc 
aattggcaat 
tggttcttga 
ttataacacc 
ttgcaaacgg 
agtcacgcat 
cgaccgaagc 
aagatactga 
ccattaccga 
ccacaagttt 
acaacgttgc 
ttgcaactgc 
acgtgtttca 
ttgaaagtaa 
taatgggtga 
ggaaaaagga 
tacagtttat 
atattaaaga 
aagggttaca 



aacttcagaa 
gtcaaaagca 
gccaatttat 
aaaaggaacc 
agaatgggaa 
aaatccattt 
atttgat ctt 
aaccaagaca 
agtggaaaag 
attatcccat 
acagattgca 
aatttatggt 
ggttctagaa 
ctgggaattc 
cgtcaagggt 
aaagccagaa 
gaaggaattg 
gttagatcat 
agcattaaca 



tattcattca 
tttagtgcac 
gatgcttatg 
agtttgaaag 
tatcacatat 
ttagaggcaa 
gaaatcatca 
tcctcgaatg 
accggattag 
tttatcccca 
cattgttatg 
ctgattgtat 
agtgatcctg 
aaacatgaaa 
ggctcagaaa 
gaaagctctg 
agggaaatgc 
tctgttgagc 
caaaaatcag 



gatctgaaca 
ctggaaaagc 
tgacagcatt 
aatctagaat 
catcaaatac 
ctatattcat 
tttactcaga 
gagaaaaaac 
gttcatcggc 
atgttatcag 
cccaaaaaaa 
atagaagatt 
agaagttccc 
gatgtacatt 
cacccaaatt 
ttgtgtatga 
gtgaaaaata 
ctttgactgt 
aggttccaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 



2 



tgaacctgat 
tggtgtggtt 
agtgggaaat 
ggttgatttg 
tttataaaat 
att ttaacat 
aagagcctga 
cat atgcagg 
accccagaga 
tattattaat 
tat actttcc 



gtccaaaccc 
ccaggtgctg 
tttaagcaga 
gaagagcaaa 
atcactggga 
attttacttc 
tcttcataaa 
aaatagctta 
cgaaataact 
ggtaacaaaa 
atattttttt 



agttgttgga 
gtggatacga 
aaactcttga 
cagaaggtgt 
tatgtctaca 
aatcttacac 
ccgttgcaaa 
ttcgacaaat 
gacaacacgc 
aaaggggrka 
ttt 



ccgttgtcaa 
tgcaatagct 
aaatccagat 
acttgaagaa 
aggtgtttt c 
tttatccttt 
ctaattgatt 
tatttatttt 
agggctgggt 
tgcccgtggt 



gagattcctg 
gtattagtgt 
tattttcata 
aaaccagaag 
gattagagtt 
taagtaggta 
atattttcta 
cgtctcgttc 
tggcattttc 
cgatacacaa 



gttgtgttgg 
tggaaaatca 
atgtttactg 
actatatagg 
tttgatcccc 
tgtgtaggga 
ttgtaaattt 
tggtccaagt 
gtcacacgat 
atatttatga 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1763 



<210> 6 
<211> 1299 
<212> DNA 

<213> Candida albicans 



<400> 6 

atgtcaaaag 

gagccaattt 

ccaaaaggaa 

ggagaatggg 

ataaatccat 

gcatttgatc 

gaaaccaaga 

gaagtggaaa 

ttattatccc 

gcacagattg 

gcaatttatg 

caggttctag 

aactgggaat 

gacgtcaagg 

gaaaagccag 

atgaaggaat 

gagttagatc 

caagcattaa 

gaccgttgtc * 

gatgcaatag 

gaaaatccag 

gtacttgaag 



catttagtgc 
atgatgctta 
ccagtttgaa 
aatatcacat 
ttttagaggc 
ttgaaatcat 
catcctcgaa 
agaccggatt 
attttatccc 
cacattgtta 
gtctgattgt 
aaagtgatcc 
tcaaacatga 
gtggctcaga 
aagaaagctc 
tgagggaaat 
attctgttga 
cacaaaaatc 
aagagattcc 
ctgtattagt 
attattttca 
aaaaaccaga 



acctggaaaa 
tgtgacagca 
agaatctaga 
atcatcaaat 
aactatattc 
catttactca 
tggagaaaaa 
aggttcatcg 
caatgttatc 
tgcccaaaaa 
atatagaaga 
tgagaagttc 
aagatgtaca 
aacacccaaa 
tgttgtgtat 
gcgtgaaaaa 
gcctttgact 
agaggttcca 
tggttgtgtt 
gttggaaaat 
taatgtttac 
agactatata 



gcatttcttg 
ttgtcatcac 
atcaaaattt 
acagagaagc 
atcgttttag 
gaccctggat 
acatttcttt 
gcaggattag 
agtacgaata 
aagataggat 
tttcagccag 
cccacagagt 
ttaccatacg 
ttggtatcac 
gaccagctta 
tacgactcag 
gttgcgatta 
attgaacctg 
ggtggtgtgg 
caagtgggaa 
tgggttgatt 
ggtttataa 



ctggtggata 
gaatgcatgc 
cttcacccca 
ccagagaagt 
cttatattca 
atcattcaca 
accattctcg 
tgtcagttgt 
aagatatttt 
ctgggtttga 
ctttgataaa 
tgaaaaaatt 
gaatcaagtt 
gagtactcca 
atagtgccaa 
acccagagac 
agaacatcag 
atgtccaaac 
ttccaggtgc 
attttaagca 
tggaagagca 



tttggttctt 
agttataaca 
atttgcaaac 
tcagtcacgc 
accgaccgaa 
agaagatact 
tgccattacc 
tgccacaagt 
gcacaacgtt 
tgttgcaact 
tgacgtgttt 
gattgaaagt 
attaatgggt 
atggaaaaag 
tttacagttt 
ttatattaaa 
aaaagggtta 
ccagttgttg 
tggtggatac 
gaaaactctt 
aacagaaggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1299 



<210> 7 
<211> 432 
<212> PRT 

<213> Candida albicans 
<400> 7 

Met Ser Lys Ala Phe Ser Ala Pro Gly Lys Ala Phe Leu Ala Gly Gly 
15 10 15 



Tyr Leu Val Leu Glu Pro lie Tyr Asp Ala Tyr Val Thr Ala Leu Ser 
20 25 30 



Ser Arg Met His Ala Val lie Thr Pro Lys Gly Thr Ser Leu Lys Glu 
35 40 45 



3 



.1. 0 QJ& «9 O fe S , Oc IS: IO S 



Ser Arg lie Lys He Ser Ser Pro Gin Phe Ala Asn Gly Glu Trp Glu 
50 55 60 

Tyr His He Ser Ser Asn Thr Glu Lys Pro Arg Glu Val Gin Ser Arg 
65 70 75 80 

He Asn Pro Phe Leu Glu Ala Thr He Phe He Val Leu Ala Tyr He 
85 90 95 

Gin Pro Thr Glu Ala Phe Asp Leu Glu He He He Tyr Ser Asp Pro 
100 105 ' HO 

Gly Tyr His Ser Gin Glu Asp Thr Glu Thr Lys Thr Ser Ser Asn Gly 
H5 120 125 

Glu Lys Thr Phe Leu Tyr His Ser Arg Ala He Thr Glu Val Glu Lys 
130 135 140 

Thr Gly Leu Gly Ser Ser Ala Gly Leu Val Ser Val Val Ala Thr Ser 
145 150 155 160 

Leu Leu Ser His Phe He Pro Asn Val He Ser Thr Asn Lys Asp He 
165 170 ' 175 

Leu His Asn Val Ala Gin He Ala His Cys Tyr Ala Gin Lys Lys He 
180 185 190 

Gly Ser Gly Phe Asp Val Ala Thr Ala He Tyr Gly Leu He Val Tyr 
195 200 ^ 205 

Arg Arg Phe Gin Pro Ala Leu He Asn Asp Val Phe Gin Val Leu Glu 
210 215 220 

Ser Asp Pro Glu Lys Phe Pro Thr Glu Leu Lys Lys Leu He Glu Ser 
225 230 235 240 

Asn Trp Glu Glu Lys His Glu Arg Cys Thr Leu Pro Tyr Gly He Lys 
245 250 255 

Leu Leu Met Gly Asp Val Lys Gly Gly Ser Glu Thr Pro Lys Leu Val 
260 265 270 

Ser Arg Val Leu Gin Trp Lys Lys Glu Lys Pro Glu Glu Ser Ser Val 
275 280 285 

Val Tyr Asp Gin Leu Asn Ser Ala Asn Leu Gin Phe Met Lys Glu Leu 
290 295 300 

Arg Glu Met Arg Glu Lys Tyr Asp Ser Asp Pro Glu Thr Tyr He Lys 
305 310 315 320 

Glu Leu Asp His Ser Val Glu Pro Leu Thr Val Ala He Lys Asn He 
325 330 335 

Arg Lys Gly Leu Gin Ala Leu Thr Gin Lys Ser Glu Val Pro He Glu 
340 345 350 



4 
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Pro Asp Val Gin 
355 

Cys Val Gly Gly 
370 

Val Leu Val Leu 
385 

Glu Asn Pro Asp 



Gin Thr Glu Gly 
420 



Thr Gin Leu Leu 
360 

Val Val Pro Gly 
375 

Glu Asn Gin Val 
390 

Tyr Phe His Asn 
405 

Val Leu Glu Glu 



Asp Arg Cys Gin 



Ala Gly Gly Tyr 
380 

Gly Asn Phe Lys 
395 

Val Tyr Trp Val 
410 

Lys Pro Glu Asp 
425 



Glu lie Pro Gly 
365 

Asp Ala lie Ala 



Gin Lys Thr Leu 
400 

Asp Leu Glu Glu 
415 

Tyr lie Gly Leu 
430 



<210> 8 
<211> 70 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Single-stranded oligonucleotide 
<400> 8 

aaatgtcaga gttgagagcc ttcagtgccc cagggaaagc gttactagct gcagctgaag 60 
cttcgtacgc 70 



<210> 9 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Single-stranded oligonucleotide 
<400> 9 

agttatttat caagataagt ttccggatct ttttctttcc taacacccca ggcataggcc 60 
actagtggat ctg 73 



<210> 10 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Single-stranded oligonucleotide 
<400> 10 

cccaagcttg gcaatgtcaa aagcatttag tgc 33 



<210> 11 



5 



<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
<400> 11 

ccgctcgaga ttttataaac ctatatagtc ttctgg 



Single-stranded oligonucleotide 

36 



6 
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WO 01/1 4533 PCT/GBOO/03100 



- 1 - 

SEQUENCE LISTING 



<110> AS TRAZ EME CA AB 



5 <;120> PROTEIN 



<130> LDSG/PHM70579/WO 



<140> 
10 <141> 



<150> GB 9919766.7 
<151> 1999-08-21 



15 <160> 11 

<170> Patentln Ver. 2.1 



<210> 1 
20 <211> 547 
<212> DNA 

<213> Candida albicans 



<400> 1 

25 ccaatggaaa aaggaaaagc cagaagaaag 
caatttacag tttatgaagg aattgaggga 
gacttatatt aaagagttag atcattctgt 
cagaaaaggg ttacaagcat taacacaaaa 
aacccagttg ttggaccgtt gtcaagagat 

30 tgctggtgga tacgatgcaa tagctgtatt 
gcagaaaact cttgaaaatc cagattattt 
gcaaacagaa ggtgtacttg aagaaaaacc 
taggatatgt ctacaaggtg atttcgatta 
acttcaa 

35 



ctctgttgtg tatgaccagc ttaatagtgc 60 
aatgcgtgaa aaatacgact cagacccaga 120 
tgagcctttg actgttgcga ttaagaacat 180 
atcagaggtt ccaattgaac ctgatgtcca 240 
ccctggttgt gttggcggtg tggttccagg '300 
agtgttggaa aatcaagtgg gaaattttaa 360 
tcataatgtt tactgggttg atttggaaga 420 
agaagactat ataggtttat aaaatatcac 480 
gattttctgc tacccgtttt aacatatttt 540 

547 



<210> 2 

<211> 21 

<212> DNA 

40 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Single- stranded oligonucleotide 



45 <400> 2 

gctggtggat acgatgcaat a 



21 



WO 01/14533 PCT/GBOO/03100 

-2- 

<210> 3 
<211> 577 
<212> DNA 

<213> Candida albicans 

5 

<400> 3 

atgtacatct ttcatgtttg aattcccagt tacttgcaat caattttttc aactctgtgg 60 
ggaacttctc aggatcactt tctagaacct gaaacacgtc atttatcaaa gctggctgaa 120 
atcttctata tacaatcaga ccataaattg cagttgcaac atcaaaccca gatcctatct 180 

10 ttttttgggc ataacaatgt gcaatctgtg caacgttgtg caaaatatct ttattcgtac 240 
tgataacatt ggggataaaa tgggataata aacttgtggc aacaactgac actaatcctg 300 
ccgatgaacc taatccggtc ttttccactt cggtaatggc acgagaatgg taaagaaaag 360 
ttttttctcc attcgaggat gtcttggttt cagtatcttc ttgtgaatga tatccagggt 420 
ccgagtaaat aatgatttca agatcaaatg cttcggtcgg ttgaatataa gctaaaaacc 480 

15 gatggatata gttgcctcta aaaatgggat ttatgcgtga ctgnacttct ttgggttttc 540 
ngtaattgat gatatgtgat antcccattc cccggtc 577 

<210> 4 
20 <211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> Description of Artificial Sequence: Single-stranded oligonucleotide 
<400> 4 

ggggataaaa tgggataata aactt 25 

30 

<210> 5 
<211> 1763 
<212> DNA 

<213> Candida albicans 

35 

<400> 5 

gtggaaaaaa aagacagaac agtagattcc aacttcagaa tattcattca gatctgaaca 60 
tttctttttc tccgatcatc aattggcaat gtcaaaagca tttagtgcac ctggaaaagc 120 
atttcttgct ggtggatatt tggttcttga gccaatttat gatgcttatg tgacagcatt 180 

40 gtcatcacga atgcatgcag ttataacacc aaaaggaacc agtttgaaag aatctagaat 240 
caaaatttct tcaccccaat ttgcaaacgg agaatgggaa tatcacatat catcaaatac 300 
agagaagccc agagaagttc agtcacgcat aaatccattt ttagaggcaa ctatattcat 360 
cgttttagct tatattcaac cgaccgaagc atttgatctt gaaatcatca tttactcaga 420 
ccctggatat cattcacaag aagatactga aaccaagaca tcctcgaatg gagaaaaaac 480 

45 att tctttac cattctcgtg ccattaccga agtggaaaag accggattag gttcatcggc 540 
aggattagtg tcagttgttg ccacaagttt attatcccat tttatcccca atgttatcag 600 
tacgaataaa gatattttgc acaacgttgc acagatcgca cattgttatg cccaaaaaaa 660 
gataggatct gggtttgatg ttgcaactgc aatttatggt ctgattgtat atagaagatt 720 
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tcagccagct ttgataaatg acgtgcttca ggttctagaa agtgatcctg agaagttccc 780 
cacagagttg aaaaaattga ttgaaagtaa ctgggaattc aaacatgaaa gatgtacact 84 0 
accatacgga atcaagttat taatgggtga cgtcaagggt ggctcagaaa cacccaaatt 90 0 
ggtatcacga gtactccaat ggaaaaagga aaagccagaa gaaagctctg ttgtgtatga 96 0 

5 ccagcttaat agtgccaatt tacagtetat gaaggaattg agggaaatgc gcgaaaaata 1020 
cgactcagac ccagagactt atattaaaga gctagatcat tctgttgagc ctttgactgt 1080 
tgcgattaag aacatcagaa aagggttaca agcattaaca caaaaatcag aggttccaat 1140 
tgaacctgat gtccaaaccc agttgttgga ccgttgtcaa gagattcctg gttgtgttgg 1200 
tggtgtggtt ccaggtgctg gtggatacga tgcaatagcc gtattagtgt tggaaaatca 1260 

10 agtgggaaat tttaagcaga aaactcttga aaatccagat tattttcata atgtttactg 1320 
ggttgatttg gaagagcaaa cagaaggtgt acttgaagaa aaaccagaag actatatagg 13 80 
tttataaaat atcactggga tatgtctaca aggtgttttc gattagagtt tttgatcccc 1440 
attttaacat attttacttc aatcttacac tttatccttt taagcaggta tgtgtaggga 1500 
aagagcctga tcttcataaa ccgttgcaaa ctaattgatt atattttcta ttgtaaattt 1560 

15 catatgcagg aaatagctta ttcgacaaat tattcatttt cgtctcgttc tggtccaagt 1620 
accccagaga cgaaataact gacaacacgc agggctgggt tggcattttc gtcacacgat 16 80 
tattattaat ggtaacaaaa aaaggggrka tgcccgtggt cgatacacaa atatttatga 1740 

1763 

tatactttcc atattttttt ttt 



20 

<210> 6 
<211> 1299 
<212> DNA 

<213> Candida albicans 

25 

<400> 6 

atgtcaaaag catttagtgc acctggaaaa 
gagccaattt atgatgctta tgtgacagca 
ccaaaaggaa ccagtttgaa agaatctaga 
30 ggagaatggg aatatcacat atcatcaaat 
ataaatccat ttttagaggc aactatattc 
gcatttgatc ttgaaatcat catttactca 
gaaaccaaga catcctcgaa tggagaaaaa 
gaagtggaaa agaccggatt aggttcatcg 
35 ttattatccc attttatccc caatgttatc 
gcacagattg cacattgtta tgcccaaaaa 
gcaatttatg gtctgattgt atatagaaga 
caggttctag aaagtgatcc tgagaagttc 
aactgggaat tcaaacatga aagatgtaca 
40 gacgtcaagg gtggctcaga aacacccaaa 
gaaaagccag aagaaagctc tgttgtgtat 
atgaaggaat tgagggaaat gcgtgaaaaa 
gagttagatc attctgttga gcctttgact 
caagcattaa cacaaaaatc agaggttcca 
45 gaccgttgtc aagagattcc tggttgtgtt 
gatgcaatag ctgtattagt gttggaaaat 
gaaaatccag attattttca taatgtttac 
gtacttgaag aaaaaccaga agactatata 



gcatttcttg ctggtggata tttggttctt 60 
ttgtcatcac gaatgcatgc agttataaca 120 
atcaaaattt cttcacccca atttgcaaac 180 
acagagaagc ccagagaagt tcagtcacgc 24 0 
atcgttttag cttatattca accgaccgaa 300 
gaccctggat atcattcaca agaagatact 360 
acatttcttt accattctcg tgccattacc 420 
gcaggattag tgtcagttgt tgccacaagt 480 
agtacgaata aagatatttt gcacaacgtt 540 
aagataggat ctgggtttga tgttgcaact 600 
tttcagccag ctttgataaa tgacgtgttt 660 
cccacagagt tgaaaaaatt gattgaaagt 720 
ttaccatacg gaatcaagtt attaatgggt 780 
ttggtatcac gagtactcca atggaaaaag 840 
gaccagctta atagtgccaa tttacagttt 900 
tacgactcag acccagagac ttatattaaa 960 
gttgcgatta agaacatcag aaaagggtta 1020 
attgaacctg atgtccaaac ccagttgttg 1080 
ggtggtgtgg ttccaggtgc tggtggatac 1140 
caagtgggaa attttaagca gaaaactctt 1200 
tgggttgatt tggaagagca aacagaaggt 1260 
ggtttataa 1299 
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<210> 7 

<211> 432 

5 <212> PRT 

<213> Candida albicans 

<400> 7 

Met Ser Lys Ala Phe Ser Ala Pro Gly Lys Ala Phe Leu Ala Gly Gly 
10 1 5 10 15 

Tyr Leu Val Leu Glu Pro lie Tyr Asp Ala Tyr Val Thr Ala Leu Ser 
20 25 30 

15 Ser Arg Met His Ala Val He Thr Pro Lys Gly Thr Ser Leu Lys Glu 
35 40 45 

Ser Arg He Lys He Ser Ser Pro Gin Phe Ala Asn Gly Glu Trp Glu 
50 55 60 

20 

Tyr His He Ser Ser Asn Thr Glu Lys Pro Arg Glu Val Gin Ser Arg 
65 70 75 80 

He Asn Pro Phe Leu Glu Ala Thr He Phe He Val Leu Ala Tyr lie 
25 85 90 95 

Gin Pro Thr Glu Ala Phe Asp Leu Glu He He He Tyr Ser Asp Pro 
100 105 HO 

30 Gly Tyr His Ser Gin Glu Asp Thr Glu Thr Lys Thr Ser Ser Asn Gly 
115 120 125 

Glu Lys Thr Phe Leu Tyr His Ser Arg Ala He Thr Glu Val Glu Lys 
130 135 140 

35 

Thr Gly Leu Gly Ser Ser Ala Gly Leu Val Ser Val Val Ala Thr Ser 
145 150 155 160 

Leu Leu Ser His Phe He Pro Asn Val He Ser Thr Asn Lys Asp He 
40 165 170 175 

Leu His Asn Val Ala Gin He Ala His Cys Tyr Ala Gin Lys Lys He 
180 185 190 

45 Gly Ser Gly Phe Asp Val Ala Thr Ala He Tyr Gly Leu He Val Tyr 
195 200 205 
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Arg Arg Phe Gin Pro Ala Leu He Asn Asp Val Phe Gin Val Leu Glu 
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210 215 220 

Ser Asp Pro Glu Lys Phe Pro Thr Glu Leu Lys Lys Leu lie Glu Ser 
225 230 235 240 

Asn Trp Glu Glu Lys His Glu Arg Cys Thr Leu Pro Tyr Gly lie Lys 
245 250 255 

Leu Leu Met Gly Asp Val Lys Gly Gly Ser Glu Thr Pro Lys Leu Val 
260 265 270 

Ser Arg Val Leu Gin Trp Lys Lys Glu Lys Pro Glu Glu Ser Ser Val 
275 280 285 

Val Tyr Asp Gin Leu Asn Ser Ala Asn Leu Gin Phe Met Lys Glu Leu 
290 295 300 

Arg Glu Met Arg Glu Lys Tyr Asp Ser Asp Pro Glu Thr Tyr lie Lys 
305 310 315 320 

Glu Leu Asp His Ser Val Glu Pro Leu Thr Val Ala He Lys Asn He 
325 330 335 

Arg Lys Gly Leu Gin Ala Leu Thr Gin Lys Ser Glu Val Pro He Glu 
340 345 350 

Pro Asp Val Gin Thr Gin Leu Leu Asp Arg Cys Gin Glu He Pro Gly 
355 360 365 

Cys Val Gly Gly Val Val Pro Gly Ala Gly Gly Tyr Asp Ala He Ala 
370 375 380 

Val Leu Val Leu Glu Asn Gin Val Gly Asn Phe Lys Gin Lys Thr Leu 
385 390 395 400 

Glu Asn Pro Asp Tyr Phe His Asn Val Tyr Trp Val Asp Leu Glu Glu 
405 410 415 

Gin Thr Glu Gly Val Leu Glu Glu Lys Pro Glu Asp Tyr lie Gly Leu 
420 425 430 



<210> 8 
<211> 70 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Single- stranded oligonucleotide 
<400> 8 

aaatgtcaga gttgagagcc ttcagtgccc cagggaaagc gttactagct gcagctgaag 60 

70 

cttcgtacgc 

<210> 9 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Single- stranded oligonucleotide 
<400> 9 

agttatttat caagataagt ttccggatct ttttctttcc taacacccca ggcataggcc 60 
actagtggat ctg 73 

<210> 10 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Single- stranded oligonucleotide 
<400> 10 

cccaagcttg gcaatgtcaa aagcatttag tgc 33 

<210> 11 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Single- stranded oligonucleotide 



<400> 11 

ccgctcgaga ttttataaac ctatatagtc ttctgg 



36 
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PROTEIN 



This invention relates to newly identified polynucleotides, polypeptides encoded by 
these polynucleotides, to the production of such polynucleotides and polypeptides, and to the 
5 uses of such polynucleotides and polypeptides. More specifically, the invention relates to the 
phosphomevalonate kinase (PMK) gene (ERG8 gene) from Candida albicans (C. albicans), to 
methods for its expression yielding phosphomevalonate kinase protein, to novel hybrid 
organisms for use in such expression methods, to methods for purification of the protein, to 
methods and tools for diagnosing C albicans infection and to assays for identifying inhibitors 
10 of the enzyme which inhibitors have potential as anti-fungal agents. 

C. albicans is an important human fungal pathogen and the most prominent target 
organism for antifungal research. PMK is an enzyme required for the biosynthesis of isoprene 
subunits that are used as precursors in the synthesis of sterols, dolichols and ubiquinones. As 
PMK is an essential biosynthetic enzyme, inhibitors of PMK should find use as antifungal 
1 5 agents. All species synthesise a protein with PMK activity however, across species the 
enzymes differ considerably in their amino acid sequence. Because of selectivity problems 
(for example fungal versus human) it is extremely important to optimise potential inhibitors 
specifically against the fungal target enzymes (i.e. C. albicans or Aspergillus fumigatus) and 
not against the human enzyme. Such cross-fungal-species inhibitors possess broad specificity. 
20 Alternatively, it may be desirable to use an inhibitor which is more selective, for example, one 
that inhibits C. albicans PMK but not a homologous but non-identical fungal PMK protein 
such as from Saccharomyces cerevisiae (5. cerevisiae). 

In view of the increased incidence of fungal resistance to existing anti-fungal agents, 
and fuelled by the growing number of fungal infections particularly in people with 
25 immunodeficiency disorders, organ transplants and cancer, there is a need for new means of 
identifying potential anti-fungal agents. 

We have now successfully cloned the ERG8 gene from C. albicans (hereinafter 
referred to as ERG8 gene) and determined its full length nucleotide sequence and 
corresponding (PMK) polypeptide sequence (hereinafter referred to as ERG8 protein) as set 
30 out in Figure 1 and SEQ ID No. 7 of this application respectively. The coding DNA sequence 
(SEQ ID NO. 6) of the C. albicans ERG8 gene isolated is 1299 nucleotides in length and the 
corresponding protein sequence is 433 amino acids in length (SEQ ID NO. 7). The protein 
exhibits approximately 45% homology with the corresponding protein from S. cerevisiae and 
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only about 10% homology to that of the human protein equivalent. Homology as used herein, 
takes the definition known to and routinely used by molecular biologists. It refers to the 
sequence identity between two sequences as assessed by best-fit computer alignment analysis 
using suitable software such as Blast, Blast2, NCBI Blast2, WashU Blast2, FastA, Fasta3 and 
5 PILEUP, using a scoring matrix such as Blosum 62. Such software packages endeavour to 
closely approximate the "gold-standard" alignment algorithm of Smith-Waterman. Thus, the 
preferred software/search engine programme for use in assessing the percent identity or 
similarity, i.e how two primary polypeptide sequences line up is Smith- Waterman. Identity 
refers to direct matches, similarity allows for conservative substitutions. 
1 0 According to a first aspect of the invention there is provided an isolated or purified 

polypeptide which is ERG8 protein, as well as variants thereof. The preferred polypeptide 
sequence is that as set out in SEQ ID NO. 7. The complete C. albicans phosphomevalonate 
kinase enzyme polypeptide has the amino acid sequence as depicted in SEQ ID No. 7 herein. 
The polypeptides of the present invention include the polypeptide of SEQ ID No. 7 as well as 
15 polypeptides which have in increasing order of preference, at least 75%, 80%, 85%, 90%, 
95%, 96%, 97%, 98%, and 99% identity to the polypeptide whose amino acid sequence is 
depicted in SEQ ID NO. 7. 

As used herein, the term "isolated" refers to molecules, either nucleic acid or amino 
acid sequences, that are removed from their natural environment and purified or separated 
20 from at least one other component with which they are naturally associated. Also 

encompassed by this term are molecules that are artificially synthesised and purified away 
from their synthesis materials. Thus, a polynucleotide is said to be isolated when it is 
substantially separated from other contaminant polynucleotides or nucleotides. 

Although the natural polypeptide of SEQ ID NO. 7 and a variant polypeptide may only 
25 possess for example 80% identity, they are actually likely to possess a higher degree of 
similarity, depending on the number of dissimilar codons that are conservative changes. 
Similarity between two sequences includes direct matches as well as conserved amino acid 
substitutes which possess similar structural or chemical properties, e.g. similar charge. 
Examples of conservative changes (conserved amino acid substitutes) are inter alia: alanine to 
30 glycine, isoleucine, valine or leucine; tyrosine to phenylalanine or tryptophan; and lysine to 
arginine or histidine. 

Suitable conservative substitutions of amino acids are known to those of skill in this 
art and may be made without altering the biological activity of the resulting polypeptide, 
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regardless of the chosen method of synthesis. The phrase "conservative substitution" includes 
the use of a chemically derivatized residue in place of a non-derivatized residue provided that 
such polypeptide displays the desired binding activity. D-isomers as well as other known 
derivatives may also be substituted for the naturally occurring amino acids. See, e.g., U.S. 
5 Patent No. 5,652,369, Amino Acid Derivatives, issued July 29, 1997. Substitutions are 
preferably, although not exclusively, made in accordance with those set forth in TABLE 1 as 
follows: 



TABLE 1 



Original residue 


Example conservative substitution 


Ala (A) 


Gly; Ser; Val; Leu; He; Pro 


Arg(R) 1 


Lys; His; Gin; Asn 


Asn (N) 


Gin; His; Lys; Arg 


Asp(D) 


Glu 


Cys(C) 


Ser 


Gln(Q) 


Asn 


Glu (E) 


Asp 


Gly(G) 


Ala; Pro 


His (H) 


Asn; Gin; Arg; Lys 


He (I) 


Leu; Val; Met; Ala; Phe 


Leu (L) 


He; Val; Met; Ala; Phe 


Lys (K) 


Arg; Gin; His; Asn 


Met (M) 


Leu; Tyr; lie; Phe 


Phe (F) 


Met; Leu; Tyr; Val; lie; Ala 


Pro (P) 


Ala; Gly 


Ser (S) 


Thr 


Thr (T) 


Ser 


Trp(W) 


Tyr; Phe 


Tyr(Y) 


Trp; Phe; Thr; Ser 


Val (V) 


He; Leu; Met; Phe; Ala 



1 0 The nucleotide sequences of the present invention may also be engineered in order to 

alter a coding sequence for a variety of reasons, including but not limited to, alterations which 
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modify the cloning, processing and/or expression of the gene product. For example, 
mutations may be introduced using techniques which are well known in the art, eg, site- 
directed mutagenesis to insert new restriction sites, to alter glycosylation patterns, to change 
codon preference, etc. 

5 Included within the scope of the present invention are alleles of the ERG8 molecule of 

the present invention. As used herein, an "allele" or "allelic sequence" is an alternative form 
of the kinase molecule described herein. Alleles result from nucleic acid mutations and 
mRNA splice-variants which produce polypeptides whose structure or function may or may 
not be altered. Any given gene may have none, one or many allelic forms. Common 
10 mutational changes which give rise to alleles are generally ascribed to natural deletions, 

additions or substitutions of amino acids. Each of these types of changes may occur alone, or 
in combination with the others, one or more times in a given sequence. 

Thus, according to a preferred embodiment there is provided an isolated polypeptide 
comprising the sequence depicted in SEQ ID No. 7 or a sequence possessing at least 80% 
1 5 similarity thereto. More preferred embodiments are those that have in increasing order of 
preference at least 85, 90, 95, 96, 97, 98 and 99% similarity to the sequence depicted in SEQ 
ID No. 7. Functional biologically active variants are preferred. 

Fragments of such polypeptides comprising at least 15, preferably at least 30 and more 
preferably at least 50 contiguous amino acids are also encompassed by the present invention. 
20 Such fragments may be used as intermediates to generate longer polypeptide fragments 

including preferably, the full-length polypeptide sequence as depicted in SEQ ID No. 7, or a 
functional variant thereof. Such polypeptide fragments may also be used to raise antibodies 
against or specific for parts of the ERG8 protein. 

The invention also relates to variant polypeptide sequences encoded by nucleic acid 
25 capable of hybridising with nucleic acid coding for the natural polypeptide (SEQ ID No. 6, or 
its complementary antisense strand)(or would do so but for the degeneracy of the genetic 
code), for example under stringent conditions (such as at 35°C to 65°C in a salt solution of 
approximately 0.9M). Such hybridisable polynucleotides are also part of the invention. The 
present invention particularly relates to polynucleotides which hybridise to the ERG8 
30 polynucleotide sequence depicted in SEQ ID NO. 6, its complementary sequence, or fragment 
thereof, under stringent conditions. As used herein, stringent conditions are those conditions 
which enable sequences that possess at least 80%, preferably at least 90% and more preferably 
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at least 95% sequence identity to hybridise together. Thus, nucleic acids which can selectively 
hybridise to the nucleic acid of SEQ ID No. 6, or the complementary antisense strand thereof, 
include nucleic acids which have at least 80%, preferably at least 90%, more preferably at 
least 95%, still more preferably at least 98% sequence identity and most preferably 100%, 
5 over at least a portion of the nucleic acid encoding the ERG8 gene disclosed herein. 

Selectively hybridise means that the molecule must be capable of specifically hybridising to 
the nucleic acid sequence of SEQ ID No. 6 or its complement, to the exclusion of other 
naturally occurring sequences. As well as full-length gene sequences, smaller nucleic acid 
fragments for example oligonucleotide primers which can be used to amplify the ERG8 gene 
1 0 using any of the well known amplification systems such as polymerase chain reaction (PCR), 
or fragments that can be used as diagnostic probes to identify corresponding nucleic acid 
sequences are also part of this invention. The invention thus includes polynucleotides of 
shorter length than the full length ERG8 gene sequence depicted in SEQ ID No. 6, that are 
capable of specifically hybridising to the nucleic acid encoding the C. albicans ERG8 gene 
15 described herein. Such polynucleotides may be at least 10 nucleotides in length, preferably at 
least 15, more preferably at least 20 and most preferably at least 30 nucleotides in length and 
maybe of any size up to and including the full length ERG8 nucleotide sequence. The 
presence of mismatch nucleotides in the hybridisation polynucleotides is not detrimental to the 
utility of such polynucleotides provided that they are capable of selectively hybridising to the 
20 target ERG8 nucleotide sequence. 

An example of a suitable hybridisation solution when a nucleic acid is immobilised on 
a nylon membrane and the probe nucleic acid is greater than 500 bases or base pairs is: 6 x 
SSC (saline sodium citrate), 0.5% SDS (sodium dodecyl sulphate), 100^g/ml denatured, 
sonicated salmon sperm DNA. The hybridisation being performed at 68°C for at least 1 hour 
25 and the filters then washed at 68°C in 1 x SSC, or for higher stringency, 0.1 x SSC/0.1% 
SDS. 

An example of a suitable hybridisation solution when a nucleic acid is immobilised on 
a nylon membrane and the probe is an oligonucleotide of between 12 and 50 bases is: 3M 
trimethylammonium chloride (TMAC1), 0.01M sodium phosphate (pH 6.8), ImM EDTA (pH 
30 7.6) , 0.5% SDS,100^ig/ml denatured, sonicated salmon sperm DNA and 0.1 dried skimmed 
milk. The optimal hybridisation temperature (Tm) is usually chosen to be 5°C below the Ti of 
the hybrid chain. Ti is the irreversible melting temperature of the hybrid formed between the 
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probe and its target. If there are any mismatches between the probe and the target, the Tm will 
be lower. As a general guide, the recommended hybridisation temperature for 1 7-mers in 3M 
TMAC1 is 48-50°C; for 19-mers,it is 55-57°C; and for 20-mers, it is 58-66°C. 

A suitable hybridisation protocol is described in Example 5 herein, however, operable 
5 variations to this method will be apparent to the person skilled in the art. 

As used herein, the term 'variant' includes naturally occurring allelic variants as well 
as non-naturally occurring variants, fragments and analogs of the sequences depicted in SEQ 
ID NOs. 6 or 7. Such variants include C- or N-truncated variants, deletion variants, 
substitution variants as well as addition and insertion variants. The term 'analog' refers to 
1 0 proproteins which can be activated by cleavage of the proprotein portion to release the 
biologically active polypeptide or protein. The term 'derivative' refers to a polypeptide 
encoded by a chemically modified ERG8 gene, for example one wherein hydrogen has been 
replaced by an acyl or amino group, as well as polypeptides possessing one or more non- 
natural amino acids. When referring to a polypeptide or protein sequence, a functional variant 
1 5 is one that has retained at least some PMK enzymatic activity. The variant polypeptides of the 
present invention may comprise internal, but preferably, terminal flanking sequences (fusion 
proteins) to facilitate protein purification. Such 'additional domain' sequences (Flag 
sequences) may comprise for example, metal chelating peptides such as histidine-tryptophan 
modules (including 6-his tags) that allow purification of the polypeptide on immobilised 
20 metals, protein A domains that allow purification on immobilised immunoglobulin, or peptide 
domains that allow purification on immobilised antibodies specific for the peptide. Other 
suitable 'additional purification domains' will be known to the person skilled in the art. 

According to a preferred embodiment of the invention the native ERG8 polypeptide 
sequence (having the sequence as depicted in SEQ ID No. 7) is fused at its amino terminus to 
25 six histidine residues which serve to enable the polypeptide, once expressed from the host 
cell, to be isolated and purified by affinity chromatography using a Ni-chelate resin. 

A flanking purification domain may be separated from the ERG8 polypeptide by a 
cleavage sequence such as that recognised by thrombin or Factor Xa so as to facilitate release 
of the polypeptide from the flanking sequence which may or may not be attached to an 
30 immobilised support. Alternatively, cyanogen bromide which cleaves at methionine residues 
can be employed to release the desired polypeptide from its flanking sequence. 
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The polypeptides of the invention can be synthesised chemically. For example, by the 
Merryfield technique (J. Amer. Chem. Soc. 85:2149-2154, 1968). Numerous automated 
polypeptide synthesisers, such as Applied Biosystems 431 A Peptide Synthesizer also now 
exist. Alternatively, and preferably, the polypeptides of the invention are produced from a 
5 nucleotide sequence encoding the polypeptide using recombinant expression technology. 
In a further aspect of the invention there are provided isolated polynucleotides 
(including genomic DNA, genomic RNA, cDNA and mRNA; double stranded as well as +ve 
and -ve strands) which encode the polypeptides of the invention. Single stranded DNA 
molecules of all or part of the ERG8 gene either +ve or -ve strand, find use inter alia, as 
10 hybridisation probes or PCR amplification primers. The sense strand of the complete gene 
sequence of native ERG8 is depicted in Figure 1 (SEQ ID No. 5) hereinafter. It will be 
appreciated that a polynucleotide of the invention may comprise any of the degenerate codes 
for a particular amino acid, including the use of rare codons. Indeed, when producing the 
polypeptide by recombinant expression in heterologous host strains, it may be desirable to 
15 adopt the codon usage (preference) of the host organism (Murray. N.A.R. 17:477-508, 1989). 
Thus, according to a further aspect invention there is provided an isolated 
polynucleotide comprising nucleic acid encoding the amino acid sequence depicted in SEQ ID 
No. 7 or a variant thereof, such as one possessing at least 80% identity thereto. 

The invention further comprises convenient fragments of any one of the above 
20 polynucleotide/nucleic acid sequences. Convenient fragments may be defined by restriction 
endonuclease digests of nucleic acid comprising the ERG8 gene sequence. Such fragments 
are useful inter alia, for expressing short polypeptides fragments of ERG8 protein of the 
invention as well as for use as hybridisation probes. The present invention also provides a 
polynucleotide probe comprising any one of the above sequences or fragments together with a 
25 convenient label or marker, preferably a non-radioactive label or marker. Following 
procedures well known in the art, the probes can be used to identify and isolate not only 
corresponding nucleic acid sequences (i.e C. albicans ERG8 gene sequences) but, if 
sufficiently homologous, can also be used to identify the analogous gene from other organisms 
using techniques well known to the person skilled in the art. Such sequences may be 
30 comprised in libraries, such as genomic or cDNA libraries. The present invention also 

provides RNA transcripts corresponding to any of the above C. albicans ERG8 sequences or 
fragments. RNA transcripts can be used to prepare a polypeptide of the invention by in vitro 
translation techniques according to known methods (Sambrook et al "Molecular Cloning- A 
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Laboratory Manual, second edition 1989"). The invention further comprises full-length or 
fragment lengths of ERG 8 gene (coding sequence) flanked by non-coding sequence which 
may include natural or non-natural sequence containing restriction enzyme recognition 
sequence motifs. The incorporation of suitable restriction enzyme recognition sites either side 
5 of the ERG8 coding region, or indeed any polynucleotide sequence from ERG8, facilitates 
cloning of the ERG8 gene or polynucleotide sequence into a suitable vector. A suitable 
polynucleotide comprises a full length C. albicans ERG8 gene (encoding the polypeptide that 
starts with methionine at position 1 and terminates with the leucine that precedes the stop 
codon TAA at position 1299 of Figure 1) flanked by unique Hindm (5'-end)-XhoI (3' -end) 
10 restriction sites. Examples of oligonucleotide primers which are suitable for use in PCR 
amplification of ERGS, and which incorporate useful restriction enzyme sites to facilitate 
cloning, are disclosed as SEQ ID Nos. 1 0 and 1 1 . Nucleotide changes or mutations may be 
introduced into a polynucleotide sequence by de novo polynucleotide synthesis, by site 
directed mutagenesis using appropriately designed oligonucleotide primers or by any other 
1 5 convenient means know to the person skilled in the art. 

For expression purposes, it may be advantageous to engineer a restriction site at the 5'- 
end which is also capable of reconstituting the native amino-terminal methionine of the 
protein. The cleavage recognition sequence for the Ncol restriction enzyme not only includes 
a sequence that codes for methionine, but also one that is capable of retaining a functional 
20 Kozak consensus sequence, enabling the ERG8 gene to be cloned at the 3 '-end of a suitable 
promoter element in an expression vector. 

The polynucleotides can be synthesised chemically, or isolated by one of several 
approaches known to the person skilled in the art such as polymerase chain reaction (PCR) or 
ligase chain reaction (LCR) or by cloning from a genomic or cDNA library. 
25 Once isolated or synthesised, a variety of expression vector/host systems may be used 

to express ERG8 coding sequences. These include, but are not limited to microorganisms 
such as bacteria expressed with plasmids, cosmids or bacteriophage; yeasts tranformed with 
expression vectors; insect cell systems transfected with baculovirus expression systems; plant 
cell systems transfected with plant virus expression systems, such as cauliflower mosaic virus; 
30 or mammalian cell systems (for example those transfected with adenoviral vectors); selection 
of the most appropriate system is a matter of choice. 

Expression vectors usually include an origin of replication, a promoter, a translation 
initiation site, optionally a signal peptide, a polyadenylation site, and a transcription 
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termination site. These vectors also usually contain one or more antibiotic resistance marker 
gene(s) for selection. As noted above, suitable expression vectors may be plasmids, cosmids 
or viruses such as phage or retroviruses. The coding sequence of the polypeptide is placed 
under the control of an appropriate promoter, control elements and transcription terminator so 
5 that the nucleic acid sequence encoding the polypeptide is transcribed into RNA in the host 
cell transformed or transfected by the expression vector construct. The coding sequence may 
or may not contain a signal peptide or leader sequence for secretion of the polypeptide out of 
the host cell. Expression and purification of the polypeptides of the invention can be easily 
performed using methods well known in the art (for example as described in Sambrook et al. 
1 0 "Molecular Cloning- A Laboratory Manual, second edition 1989"). 

The vectors containing the DNA coding for the ERG8 polypeptides of the invention 
can be introduced (i.e transformed or transfected) into E. coli, S. cerevisiae, Pichia pastoris or 
any other suitable host to facilitate their manipulation (i.e. for mutagenesis, cloning or 
expression). Performance of the invention is neither dependent on nor limited to any 
1 5 particular strain of host cell or vector; those suitable for use in the invention will be apparent 
to, and a matter of choice for, the person skilled in the art. 

Host cells transformed or transfected with a vector containing an ERG8 nucleotide 
sequence may be cultured under conditions suitable for the expression and recovery of the 
encoded proteins from the cell culture. Such expressed proteins/polypeptides may be secreted 
20 into the culture medium or they may be contained intracellular^ depending on the sequences 
used, i.e. whether or not suitable secretion signal sequences were present. 

The full-length native isolated C. albicans ERG8 protein (PMK enzyme) of the present 
invention, or a functional variant thereof, is useful as a target in biochemical assays, 
particularly for use in identifying inhibitors of the enzyme. However, to provide sufficient 
25 enzyme for a biochemical assays (for example, for use in a high throughput screen for enzyme 
inhibitors) the enzyme has to be expressed at high levels and it has to be purified. Two major 
constraints impair ERG8 expression and purification: (i) ERG8 is not expressed at high levels 
from G albicans, and (ii) expression and protein purification methodology is not well 
advanced for G albicans. 
30 We have now been able to overcome these problems by controlled over-expression of 

the C albicans ERG8 in a strain of Saccharomyces cerevisiae. S. cerevisiae is a model 
system for expression and purification of recombinant proteins. Use of S. cerevisiae to 
express C albicans ERG8 means that transformation, expression and purification 
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methodology used to produce and isolate the ERG8 protein can follow published procedures. 
As stated above, the invention is not limited to use of S. cerevisiae as the host for expression 
of C albicans ERG8. 

According to a further aspect of the invention there is provided a host cell adapted to 
5 express C. albicans ERG8 polypeptide or a variant thereof. The yeast S. cerevisiae is the 
preferred host cell of choice. According to a further aspect of the invention there is provided a 
novel expression system for expression of the C. albicans ERGS gene, which system 
comprises an S. cerevisiae host strain having the C. albicans ERG8 gene in place of the native 
ERG8 gene from S. cerevisiae, whereby the C albicans ERG8 gene is expressed. Preferred S. 

10 cerevisiae strains include JK9-3Daot and its haploid segregants. 

The C. albicans ERG8 gene is preferably over-expressed relative to the expression 
derived from its own promoter. This is conveniently achieved by replacing the C. albicans 
ERG8 promoter by a stronger and preferably inducible promoter such as the S. cerevisiae 
GAL1 promoter, alpha factor or alcohol oxidase (for reviews see Ausubel et al. "Current 

1 5 Protocols in Molecular Biology", John Wiley & Sons, New York.). 

The novel expression system is conveniently prepared by transformation of a 
heterozygous ERG8 deletion strain of a convenient S. cerevisiae host by a suitable plasmid 
comprising the C. albicans ERG8 gene using methods well known in the art (Ito et al. J. 
Bacteriol. 153:163-168,1983; Schiestl and Grietz, Current Genetics 16:339-346,1989). 

20 The plasmid comprising the C. albicans ERG8 represents a further aspect of the 

invention. Particularly suitable plasmids for expression of C. albicans ERG8 in 5. cerevisiae 
include pYES2(Invitrogen) and plasmids derived from pYES2 carrying a native S. cerevisiae 
promoter such as the glyceraldehyde-3-dehydrogenase promoter. 

The heterozygous ERGS deletion strain of a diploid S. cerevisiae host is conveniently 

25 achieved by disruption preferably using an antibiotic resistance cassette such as the kanamycin 
resistance cassette described by Wach et al (Yeast. 10:1793-1808, 1994). 

As described earlier, the C. albicans ERG8 enzyme may be used in biochemical assays 
to identify agents which modulate the activity of the enzyme. The design and implementation 
of such assays will be evident to the biochemist of ordinary skill. The enzyme may be used to 

30 turn over a convenient substrate whilst incorporating/losing a labelled component to define a 
test system. Test compounds are introduced into the test system and measurements made to 
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determine their effect on enzyme activity. Such assays are useful to identify inhibitors of the 
enzyme which may then prove valuable as antifungal agents. 

Thus, in a further aspect of the invention we provide the use of a G albicans ERG-8 
gene and/or C. albicans PMK enzyme in an assay to identify inhibitors of the enzyme. In 
5 particular, we provide their use in pharmaceutical or agrochemical research. 

Thus, according to a further aspect of the invention there is provided a method of 
identifying compounds that modulate, preferably inhibit, the activity of phosphomevalonate 
kinase (PMK), comprising, contacting a test compound with a polypeptide of the invention 
and determining the effect that the test compound has on the activity of the polypeptide. 
10 The PMK (ERG8) protein catalyses the conversion of phosphomevalonate + ATP to 

pyrophosphomevalonate + ADP. By way of non-limiting example, the activity of the ERG8 
enzyme may be determined by (i) measuring the increase in ADP production, (ii) by following 
the loss of ATP, or (iii) by monitoring transfer of radioactive label (i.e H 3 , C 14 , P 32 ) into 
phosphomevalonate. 

1 5 A suitable assay that measures ADP production involves coupling the ADP produced 

by the action of PMK on phosphomevalonate + ATP substrate with pyruvate kinase and 
phosphoenolpyruvate to form pyruvate and ATP. The pyruvate is then reduced to lactate with 
lactate dehydrogenase which converts NADH to NAD. The production of NAD (directly 
linked to ADP production indicative of PMK action) is conveniently measured by detecting 

20 the change in absorbance at 340nm (NADH oxidation product). In this assay, test compounds 
that inhibit PMK activity are identified by determining the ability of a compound to inhibit 
PMK activity as assessed by a reduction in ADP production as gauged by a reduction in the 
production of NAD from NADH using pyruvate kinase and lactate dehydrogenase as coupling 
enzymes as described above. The person skilled in the art would be able to develop other 

25 assays for measuring PMK activity without inventive input. 

ATP can be conveniently assayed using commercially available kits (i.e Boehringer 
Mannheim) to monitor luminescence resulting from oxidation of luciferin to luciferase (Ford 
et al. J. Biolumin. Chemilumin. 11:149-167, 1996). 

A suitable reaction that measures the production of radioactively labelled 

30 phosphomevalonate involves incubation PMK enzyme with cofactors, substrate ATP and 
phosphomevalonate, one of which carries a radioactive label. After reaction, 
pyrophosphomevalonate can be resolved from unreacted substrate by high voltage 
electrophoresis at pH3.5 on 3MM paper and the amount of radioactivity incorporated into 



WO 01/14533 PCT/GB00/03100 

-12- 

pyrophosphomevalonate can be measured by scintillation counting (Lee andO' Sullivan. J. 
Biol. Chem. 260:13909-13915, 1985). 

Any convenient test compound or library of test compounds may be used in 
conjunction with the test assay. Particular test compounds include low molecular weight 
5 chemical compounds (preferably with a molecular weight less than 1500 daltons) suitable as 
pharmaceutical or veterinary agents for human or animal use, or compounds for non- 
administered use such as cleaning/sterilising agents or for agricultural use. 

The ERG8 enzyme of the invention, and convenient fragments thereof may be used to 
raise antibodies. Such antibodies have a number of uses which will be evident to the 
10 molecular biologist or immunologist of ordinary skill. Such uses include, but are not limited 
to, monitoring enzyme expression, development of assays to measure enzyme activity, 
precipitation or purification of the enzyme and as a diagnostic tool to detect C. albicans. 
Enzyme linked immunosorbant assays (ELIS As) are well known in the art and would be 
particularly suitable for detecting the ERG8 polypeptide or fragments thereof. Antibodies 
1 5 raised against the polypeptides of the invention may be polyclonal, obtained for example by 
injecting the polypeptide(s) into a selected mammal (i.e. rabbit, mouse, goat or horse), and 
later collecting the immunised serum from the animal, and treating this according to 
procedures known in the art. Depending on the host species, various adjuvants may be used to 
enhance the immunological response against the injected polypeptide. Suitable adjuvants 
20 include, but are not limited to Freud's, aluminium hydroxide and SAF. Antibodies may also 
be monoclonal antibodies produced by hybridoma cells, phage display libraries or other 
methodology. Monoclonal antibodies may be inter alia, human, rat or mouse derived. For the 
production of human monoclonal antibodies, hybridoma cells may be prepared by fusing 
spleen cells from an immunised animal, e.g. a mouse, with a tumour cell. Appropriately 
25 secreting hybridoma cells may thereafter be selected (Koehler & Milstein. Nature. 256:495- 
497, 1975; Cole et al. "Monoclonal antibodies and Cancer Therapy, Alan R Liss Inc, New 
York N.Y. pp 77-96). Rodent antibodies may be humanised using recombinant DNA 
technology according to techniques known in the art. Alternatively, chimeric antibodies, 
single chain antibodies, Fab fragments may also be developed against the polypeptides of the 
30 invention (Huse et al. Science. 256:1275-1281, 1989), using skills known in the art. 

The polynucleotides and antibodies of the invention may be used in gene-probe or 
protein-probe methodologies, with or without amplification (for example, via PCR or second 
antibody detection) to detect or diagnose the presence of C albicans. This is particularly 
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valuable in diagnosing clinical infections. Accordingly, the invention provides diagnostic kits 
for the detection of C. albicans ERG8 or fragments thereof, and provides for the use of ERG8 
protein, polypeptide fragments thereof and/or antibodies raised thereagainst as positive 
control. The reagents in the kit may be compartmentalised and the kit may also comprise 

5 instructions for use. 

DNA diagnostics is based on DNA/RNA hybridisation technology, i.e. the specific in 
vitro binding of complementary single-stranded nucleic acid with the formation of double- 
stranded nucleic acid. The DNA/DNA or DNA/RNA double strands formed are termed 
hybrids. To detect the presence of C. albicans in a bodily fluid such as blood, total nucleic 

10 acid is isolated from the test fluid sample using standard techniques and the presence of C 
sdbicans ERG8 nucleic acid in the sample is detected using for example detectably labelled 
probes comprising one or more of the polynucleotides of the invention. The probes can be 
short, chemically synthesised oligonucleotide probes of a length of approximately 10-50 
nucleotides, or may be recombinantly expressed fragments of the ERG8 gene of 

1 5 approximately 0.3-1 .5Kb in size. Single stranded oligonucleotide probes which are specific 
for C. albicans are preferred. The probe can be provided with a suitable detectable reporter 
molecule label such as a radioisotope (P 32 , tritium, C 14 or S 35 ), or a non-radioactive label such 
as digoxigenin or biotin, using techniques available to the person skilled in the art. Prior to the 
hybridisation reaction, all or any part of C. albicans ERG8 DNA containing the sequence to 

20 which the probe can hybridise, present in the test sample is amplified using for example PCR 
(polymerase chain reaction) or LCR (ligase chain reaction). For the specific hybridisation 
reaction, the test nucleic acid and if necessary the probe DNA is converted into single strands 
by denaturation (heat or alkali) and then very specifically hybridised with each other under 
stringent conditions. Under appropriate conditions the gene probe only hybridises to 

25 complementary sequences of the DNA or RNA to be detected. The hybridisation and 

detection assay can be carried out in a number of different formats known to the person skilled 
in the art including, solid-phase hybridisation of target DNA or probe coupled to a solid 
support such as nitrocellulose or magnetic beads. The hybridisation complex can then be 
determined quantitatively, following removal of unbound probe or test nucleic acid, by way of 

30 the reporter molecule label (e.g. fluorescent or radioactive) employed. 

The test sensitivity of this single gene-probe diagnostic method can be increased by 
combination with DNA or RNA amplification techniques such as PCR or LCR. Using such 
amplification techniques, the DNA to be detected can be multipled by up to 10 9 . 
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There may only be 100-1000 organisms per ml of blood in association with Candida 
infections. Such small numbers of cells are easily detectable when combining the 
amplification and DNA-probe detection techniques offering the possibility of early detection 
of infection. 

5 Thus, according to a further aspect of the invention there is provided a method of 

diagnosing the presence of the C. albicans ERG8 gene in a test sample, comprising: contacting 
a polynucleotide probe of at least 15 nucleotides in length, which probe is capable of 
specifically hybridising with the sequence depicted in SEQ ID No. 6, with the test sample 
under conditions which allow duplex formation between said polynucleotide probe and the 

10 nucleic acid in the test sample; and, detecting duplex formation. In a preferred embodiment 
the polynucleotide probe is detectably labelled. In another embodiment the polynucleotide 
probe is single stranded. In another embodiment the polynucleotide probe is completely 
complementary to the target sequence to be detected. According to a further aspect of the 
invention the polynucleotide probe is substituted for by a pair of oligonucleotide primers 

1 5 capable of specific PCR amplification of all or part of the ERG8 gene in the test sample, with 
subsequent identification of amplification product. 

According to another aspect of the present invention there is provided a diagnostic kit 
for diagnosing or detecting the presence of C. albicans comprising, one or more diagnostic 
probe(s) and/or diagnostic primer(s) and/or antibodies capable of selectively hybridising or 

20 binding to the polynucleotide of SEQ ID No. 6 or the polypeptide of SEQ ID No. 7, or to 
variant sequences thereof as defined herein. 

In a preferred embodiment, the diagnostic (detection) probes are provided on a 
microarray. 

Such kits may further comprise appropriate buffer(s) and/or polymerase(s) such as 
25 thermostable polymerases, for example taq polymerase. They may also comprise 

companion/constant primers and/or control primers or probes. A companion/constant primer 
is one that is part of the pair of primers used to perform PCR. Such primer usually 
complements the template strand precisely. 

In another embodiment the kit is an ELISA kit comprising one or more antibodies 
30 specific for the polypeptide depicted in SEQ ID No. 7, or a variant thereof as defined herein. 

The following examples and figure describe and illustrate the invention. They are not 
intended to limit the scope of the invention in any way: 
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Figure 1 shows the nucleotide sequence of the C albicans gene encoding 
phosphomevalonate kinase. Translation start (ATG) and stop (TAA) codons are highlighted. 

Examples 

5 

1. Cloning and partial sequence determination of two separate clones from a Candida 
albicans genomic library. 

Two separate cloned and sequenced nucleic acid sequences from a C. albicans library 
(SEQ ID NOs. 1 & 3) were found to have homology to that of S. cerevisiae ERG 8 gene. The 
10 complement of specific regions in SEQ ID Nos. 1 and 3 were synthesised as oligonucleotides 
(SEQ ID Nos. 2 & 4) for use in the isolation of a clone containing the C. albicans ERG8 gene. 

2. Cloning and sequence determination of Candida albicans ERG8. 

Using the two oligonucleotide primers (SEQ ID Nos. 2 and 4), the C. albicans ERG8 

15 gene was isolated as a plasmid clone from a library of C. albicans genomic DNA in the yeast 
shuttle vector YEp24 using PCR. The C albicans library was maintained in E. coli and 
independent bacterial colonies were grown in single wells of each of 15 x 384-well microtitre 
plates. The properties of the library plasmids are such that this gridded array contains 
approximately 2.5x the amount of DNA in the C. albicans genome. 

20 Small aliquots of cells from each of the wells were mixed to produce a pool of cells 

that were derived from all of the wells from a single plate. Similar pools were made for all of 
the rows and all of the columns from each of the plates. Samples of each of the pools of the 
cells for each complete plate were used in PCR reactions with SEQ ID Nos. 2 and 4 
oligonucleotide primers to identify plate(s) in the array carrying C. albicans ERG8. 

25 Subsequent PCR reactions with pools of cells from rows of wells and columns of wells 
defined the specific well(s) carrying a clone of C. albicans ERG8. 

The PCR reactions contained in a total volume of 0.05ml: 75mM Tris-HCl (pH 8.8 at 
25°C), 20mM (NH4) 2 S0 4 , 1.5mM MgCl 2 , 0.01% Tween 20, 0.2mM of each of dATP, dCTP, 
dGTP and dTTP, 1.25 units Taq DNA polymerase, lOOpmoles of each oligonucleotide primer 

30 and 0.005ml E> coli cell suspension. PCR reactions were incubated at 94°C for 1 min then for 
30 cycles of the following: 94°C for 1 min, 55°C for 1 min, 72°C for 1 min. PCR products 
were analysed by electrophoresis through agarose and visualised under UV light after staining 
with ethidium bromide. 
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Putative clones harbouring the ERG8 gene were selected, the plasmid DNAs in these 
clones were purified and the complete sequence of the C. albicans ERG8 gene was 
determined on both strands using flanking sequence- or insert sequence-specific 
oligonucleotide primers. The full-length of the C. albicans ERG8 gene, including 'start' ATG 
5 and 'stop' TAA is shown in Figure 1. The protein translation of the gene is depicted in SEQ 
ID No. 7. 

3. Generation of a heterozygous ERG8 deletion strain of S. cerevisiae 

Since PMK is an essential enzyme, only one allele of a diploid cell can be deleted 
10 without loss of viability. One ERG8 gene diploid strain of S. cerevisiae (JK9-3daa; Kunz et 
al., Cell 73:585-596 (1993)) was disrupted using a kanamycin resistance cassette as described 
by Wach et al. (Yeast 10:1793-1808, 1994) using the protocol described therein with the 
oligonucleotides shown in SEQ ED Nos. 8 and 9. Sporulation of the heterozygous diploid 
(ERG8/erg8::KanMX) yields only two viable spores that are both sensitive to kanamycin, 
15 showing ERG8 to be essential, and the characteristic arrest phenotype for the two inviable 
spores. 

4. Complementation of a S. cerevisiae ERG8 deletion with the cloned C. albicans ERG8 

The heterozygous ERG8/erg8::KanMX strain was transformed with the plasmid 
20 carrying the full-length C albicans ERG8 gene within a fragment of C. albicans genomic 
DNA such that expression of the gene will depend on functionality of the C albicans 
promoter in the heterologous S. cerevisiae host. Surprisingly, the gene carried on the plasmid 
failed to complement the gene deletion as demonstrated by a failure to recover kanamycin- 
resistant haploid cells after sporulation. This was probably due to inappropriate expression of 
25 C albicans ERG8 in S. cerevisiae. 

To enable expression of C. albicans ERG8 in S. cerevisiae and to facilitate 
purification of ERG8 protein as a result of over-expression in a suitable host, the C. albicans 
promoter was replaced by the efficient, inducible S. cerevisiae GAL1 promoter. The C 
albicans ERG8 coding sequence was amplified by PCR using the oligonucleotides shown in 
30 SEQ ID Nos. 10 and 11, which contain convenient restriction enzyme sites for cloning the 
product of PCR into an appropriate expression vector such as pYES2 (Invitrogen). The 
identity of the PCR-amplified gene cloned into pYES2 was confirmed by DNA sequencing. 
After transformation into the heterozygous ERG8/erg8::KanMX strain, the plasmid was able 
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to complement the erg8::KanMX allele in S. cerevisiae since kanamycin-resistant haploid 
spores were viable on medium containing galactose but not glucose. This S. cerevisiae strain 
is a useful source of biologically active C. albicans ERG8 protein for assays in vitro. 

C. albicans ERG8 can also be conveniently over-expressed in bacteria such as E. coli. 
5 The C. albicans ERG8 coding sequence is amplified by PCR using oligonucleotides 

containing convenient restriction sites for cloning into expression vectors such as pT7#3.3. It 
is particularly convenient if the initiation codon for ERG8 is incorporated within one of the 
restriction sites. Oligonucleotides suitable for this are shown in SEQ ID Nos. 12 and 13. 
Oligonucleotides may also incorporate extra sequences to encode a small "tag" that aids the 

1 0 subsequent purification of the protein. Such tags include for example, the "His6" tags which 
may be incorporated at the N- or C-terminus of ERG 8 using the oligonucleotides shown in 
SEQ ID Nos. 14 and 15. Recombinantly expressed tagged ERG8 protein can be conveniently 
purified by affinity chromatography purification methodology using commercially available 
purification kits (i.e Qiagen) (Borsig et ah, Biochem. Biophys. Res. Commun. 240:586-589, 

15 1997). 

5. Hybridisation test of nucleic acid variations of specific nucleic acid sequences 

5.1 Hybridisation Test 

A method for detecting variant nucleic acids containing sequences related to specific 
20 ERG8 sequences such as natural alleles, is described. These variant nucleic acids may be 
present in a variety of forms such as within plasmids or other like vehicles which may be 
fixed on to a hybridisation membrane, such as a nitrocellulose or nylon filter ready for 
detection using a labelled probe. Hybridisation assays can also be performed to identify 
variant sequences from within genomic or cDNA libraries. Hybridisation technology is well 
25 advanced. It will be apparent to the person skilled in the art that the protocol described below 
is only one example of a hybridisation protocol suitable to identify ERG8 variant sequences. 

5.2 Hybridisation probe 

Hybridisation probes may be generated from any fragment of DNA or RNA encoding 
the specific ERG8 nucleic sequence of interest. Such fragments can be for example, restriction 
30 fragments isolated following restriction enzyme digestion of nucleic acid containing the 
ERG8 nucleotide sequence or synthetic oligonucleotides specific for a region of the ERG8 
gene or a complementary sequence thereto. 
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A hybridisation probe can be generated from a synthetic oligonucleotide or a 
dephosphorylated restriction fragment sequence by addition of a radioactive 5' phospate group 

from [y- P] ATP by the action of T4 polynucleotide kinase. 20 pmoles of the 
oligonucleotide are added to a 20^1 reaction containing lOOmM Tris, pH7.5 , lOmM MgCl2, 

5 O.lmM spermidine, 20mM dithiothreitol (DTT), 7.55|iM ATP, 55|iCi [y- 32 P]ATP and 2.5u 
T4 polynucleotide kinase (Pharmacia Biotechnology Ltd, Uppsala, Sweden). The reaction is 

incubated for 30 minutes at 37°C and then for 10 minutes at 70°C prior to use in 
hybridisation. Methods for the generation of hybridisation probes from oligonucleotides or 
from DNA and RNA fragments (Chapters 1 1 and 10 respectively in Sambrook et al. ibid). A 
10 number of proprietary kits are also available for these procedures. 
5.3 Hybridisation conditions 

Filters containing the nucleic acid are pre-hybridised in 1 00ml of a solution 

containing 6x SSC, 0.1%SDS and 0.25% dried skimmed milk (Marvel™) at 65°C for a 
minimum of 1 hour in a suitable enclosed vessel. A proprietary hybridisation apparatus such 
15 as model HB-1 (Techne Ltd) provides reproducible conditions for the experiment. 

The pre-hybridisation solution is then replaced by 10ml of a probe solution 
containing 6xSSC, 0.1% SDS, 0.25% dried skimmed milk (e.g. Marvel™) and the 
oligonucleotide probe generated above. The filters are incubated in this solution for 5 minutes 

at 65°C before allowing the temperature to fall gradually to below 30°C. The probe solution 
20 is then discarded and the filters washed in 100ml 6xSSC, 0.1% SDS at room temperature for 5 

minutes. Further washes are then made in fresh batches of the same solution at 30°C and then 

in 10°C increments up to 60°C for 5 minutes per wash. 

After washing, the filters are dried and used to expose an X-ray film such as 

Hyperfilm™ MP (Amersham International) at -70°C in a light-tight film cassette using a fast 
25 tungstate intensifying screen to enhance the photographic image. The film is exposed for a 
suitable period (normally overnight) before developing to reveal the photographic image of 
the radio-active areas on the filters. Related nucleic acid sequences are identified by the 
presence of a photographic image compared to totally unrelated sequences which should not 
produce an image. Generally, related sequences will appear positive at the highest wash 
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temperature (60 C). However, related sequences may only show positive at the lower wash 

temperatures (50, 40 or 30°C). 

These results will also depend upon the nature of the probe used. Longer nucleic 
acid fragment probes will need to be hybridised for longer periods at high temperature but 
5 may remain bound to related sequences at higher wash temperatures and/or at lower salt 
concentrations. Shorter, mixed or degenerate oligonucleotide probes may require less 

stringent washing conditions such as lower temperatures and/or higher Na + concentrations. A 
discussion of the considerations for hybridisation protocols is provided in Sambrook et al. 
(Chapter 11). 

10 To prepare 20 x SSC,175.3 g of NaCl and 88.2 g of sodium citrate is dissolved in 

approximately 800ml of water, the pH is adjusted to 7.0 using 10 N solution of NaOH and the 
volume is adjusted to 1 litre with water, before autoclaving. 
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Claims: 

1 . A purified polypeptide comprising the amino acid sequence depicted in SEQ ID No. 7 or a 
sequence possessing at least 80% similarity thereto. 

2. An isolated polypeptide of at least 15 contiguous amino acids of the polypeptide of claim 1. 
5 3. An antibody specific for the polypeptide of claim 1 or 2. 

4. An antibody as claimed in claim 3 which is a monoclonal antibody. 

5. A purified polynucleotide comprising a nucleic acid sequence encoding the polypeptide 
depicted in SEQ ID No. 7 or a sequence possessing at least 80% identity thereto. 

6. A polynucleotide of at least 15 nucleotides in length, which polynucleotide is capable of 
1 0 specifically hybridising to a nucleic acid sequence selected from the group consisting of 

SEQ ID Nos. 1, 3, 5 or 6, or a sequence complementary to any of said sequences. 

7. An expression vector comprising the polynucleotide of claim 5. 

8. A host cell which contains an expression vector according to claim 7. 

9. A method for producing the polypeptide of claim 1 , comprising: 

15 (a) culturing a host cell according to claim 8 under conditions suitable for the expression of 
said polypeptide, and 
(b) recovering said polypeptide from the host cell or cell culture. 

10. Use of the polypeptide of claim 1 in an assay to identify compounds that inhibit 
phosphomevalonate kinase (PMK) activity. 

20 1 1 . A method of identifying compounds that modulate the activity of PMK, comprising: 

(a) contacting a test compound with a polypeptide according to claim 1 , and 

(b) determining the effect that the test compound has on the activity of the polypeptide. 

12. A compound identified by the method of claim 1 1 . 

13. A method for detecting or diagnosing the presence of Candida albicans in a test sample, 
25 comprising contacting the sample with an agent capable of detecting a polypeptide 

possessing the amino acid sequence depicted in SEQ ID No. 7 or a sequence possessing at 
least 80% similarity thereto, or a nucleic acid sequence encoding the polypeptide depicted 
in SEQ ID No. 7 or a sequence possessing at least 80% identity thereto. 

14. A method as claimed in claim 13 wherein the presence of the nucleic acid is detected 
30 using an oligonucleotide primer or probe capable of selectively hybridising to the said 

polynucleotide. 

15. A diagnostic kit for detecting the presence of C. albicans comprising: one or more 
diagnostic probe(s) and/or diagnostic primer(s) and/or antibodies capable of selectively 
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hybridising or binding to the polynucleotide of claim 6 or the polypeptide of claim 1, and 
instructions for use. 
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GTGGAAAAAAAAGACAGAACAGTAGATTCCAACTTCAGAATATTCATTCAGATCTGAACATTT 

CTTTTTCTCCGATCATCAATTGGCA ATG TCAAAAGCATTTAGTGCACCTGGAAAAGCATTTCT 

TGCTGGTGGATATTTGGTTCTTGAGCCAATTTATGATGCTTATGTGACAGCATTGTCATCACG 

AATG CAT G C AGTT ATAACAC CAAAAGGAAC C AGTTT GAAAGAATC TAGAAT CAAAATTT CTTC 

AC C C C AATTTG CAAACGGAGAATGGGAATAT CAC ATATCAT CAAATACAGAGAAG C C CAGAGA 

AGTTCAGTCACGCATAAATCCATTTTTAGAGGCAACTATATTCATCGTTTTAGCTTATATTCA 

ACCGACCGAAGCATTTGATCTTGAAATCATCATTTACTCAGACCCTGGATATCATTCACAAGA 

AGATACTGAAACCAAGACATCCTCGAATGGAGAAAAAACATTTCTTTACCATTCTCGTGCCAT 

TACCGAAGTGGAAAAGACCGGATTAGGTTCATCGGCAGGATTAGTGTCAGTTGTTGCCACAAG 

TTTATTATCCCATTTTATCCCCAATGTTATCAGTACGAATAAAGATATTTTGCACAACGTTGC 

ACAGATTGCACATTGTTATGCCCAAAAAAAGATAGGATCTGGGTTTGATGTTGCAACTGCAAT 

TTATGGTCTGATTGTATATAGAAGATTTCAGCCAGCTTTGATAAATGACGTGTTTCAGGTTCT 

AGAAAGTGATCCTGAGAAGTTCCCCACAGAGTTGAAAAAATTGATTGAAAGTAACTGGGAATT 

CAAACATGAAAGATGTACATTACCATACGGAATCAAGTTATTAATGGGTGACGTCAAGGGTGG 

CTCAGAAACACCCAAATTGGTATCACGAGTACTCCAATGGAAAAAGGAAAAGCCAGAAGAAAG 

CTCTGTTGTGTATGACCAGCTTAATAGTGCCAATTTACAGTTTATGAAGGAATTGAGGGAAAT 

GCGTGAAAAATACGACTCAGACCCAGAGACTTATATTAAAGAGTTAGATCATTCTGTTGAGCC 

TTTGACTGTTGCGATTAAGAACATCAGAAAAGGGTTACAAGCATTAACACAAAAATCAGAGGT 

TCCAATTGAACCTGATGTCCAAACCCAGTTGTTGGACCGTTGTCAAGAGATTCCTGGTTGTGT 

TGGTGGTGTGGTTCCAGGTGCTGGTGGATACGATGCAATAGCTGTATTAGTGTTGGAAAATCA 

AGTGGGAAATTTTAAGCAGAAAACTCTTGAAAATCCAGATTATTTTCATAATGTTTACTGGGT 

TGATTTGGAAGAGCAAACAGAAGGTGTACTTGAAGAAAAACCAGAAGACTATATAGGTTTATA 

AAATATCACTGGGATATGTCTACAAGGTGTTTTCGATTAGAGTTTTTGATCCCCATTTTAACA 

TATTTTACTTCAATCTTACACTTTATCCTTTTAAGTAGGTATGTGTAGGGAAAGAGCCTGATC 

TTCATAAACCGTTGCAAACTAATTGATTATATTTTCTATTGTAAATTTCATATGCAGGAAATA 

GCTTATTCGACAAATTATTTATTTTCGTCTCGTTCTGGTCCAAGTACCCCAGAGACGAAATAA 

CTGACAACACGCAGGGCTGGGTTGGCATTTTCGTCACACGATTATTATTAATGGTAACAAAAA 

AAGGGGRKATGCCCGTGGTCGATACACAAATATTTATGATATACTTTCCATATTTTTTTTTT 



FIGURE 1 
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SEQUENCE LISTING 



<110> ASTRAZENECA AB 

5 <12 0> PROTEIN 

<130> LDSG/PHM70579/WO 

<140> 
10 <141> 

<150> GB 9919766.7 
<151> 1999-08-21 

15 <160> 11 

<170> Patentln Ver. 2.1 

<210> 1 
20 <211> 547 
<212> DNA 

<213> Candida albicans 
<400> 1 

25 ccaatggaaa aaggaaaagc cagaagaaag ctctgttgtg tatgaccagc ttaatagtgc 60 
caatttacag tttatgaagg aattgaggga aatgcgtgaa aaatacgact cagacccaga 120 
gacttatatt aaagagttag atcattctgt tgagcctttg actgttgcga ttaagaacat 180 
cagaaaaggg ttacaagcat taacacaaaa atcagaggtt ccaattgaac ctgatgtcca 240 
aacccagttg ttggaccgtt gtcaagagat tcctggttgt gttggcggtg tggttccagg 300 

30 tgctggtgga tacgatgcaa tagctgtatt agtgttggaa aatcaagtgg gaaattttaa 360 
gcagaaaact cttgaaaatc cagattattt tcataatgtt tactgggttg atttggaaga 420 
gcaaacagaa ggtgtacttg aagaaaaacc agaagactat ataggtttat aaaatatcac 480 
taggatatgt ctacaaggtg atttcgatta gattttctgc tacccgtttt aacatatttt 540 
acttcaa 547 

35 

<210> 2 
<211> 21 
<212> DNA 
40 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Single- stranded oligonucleotide 
45 <400> 2 

gctggtggat acgatgcaat a 21 
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<210> 3 
<211> 577 
<212> DNA 

<213> Candida albicans 

5 

<400> 3 

atgtacatct ttcatgtttg aattcccagt 
ggaacttctc aggatcactt tctagaacct 
atcttctata tacaatcaga ccataaattg 

10 ttttttgggc ataacaatgt gcaatctgtg 
tgataacatt ggggataaaa tgggataata 
ccgatgaacc taatccggtc ttttccactt 
ttttttctcc attcgaggat gtcttggttt 
ccgagtaaat aatgatttca agatcaaatg 

15 gatggatata gttgcctcta aaaatgggat 
ngtaattgat gatatgtgat antcccattc 



-2- 



tacttgcaat 


caattttttc 


aactctgtgg 


60 


gaaacacgtc 


atttatcaaa 


gctggctgaa 


120 


cagttgcaac 


atcaaaccca 


gatcctatct 


180 


caacgttgtg 


caaaatatct 


ttattcgtac 


240 


aacttgtggc 


aacaactgac 


actaatcctg 


300 


cggtaatggc 


acgagaatgg 


taaagaaaag 


360 


cagtatcttc 


ttgtgaatga 


tatccagggt 


420 


cttcggtcgg 


ttgaatataa 


gctaaaaacc 


480 


ttatgcgtga 


ctgnacttct 


ttgggttttc 


540 


cccggtt 






577 



<210> 4 
20 <211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> Description of Artificial Sequence: Single- stranded oligonucleotide 
<400> 4 

ggggataaaa tgggataata aactt 2 5 

30 

<210> 5 
<211> 1763 
<212> DNA 

<213> Candida albicans 

35 

<400> 5 

gtggaaaaaa aagacagaac agtagattcc aacttcagaa tattcattca gatctgaaca 60 
tttctttttc tccgatcatc aattggcaat gtcaaaagca tttagtgcac ctggaaaagc 120 
atttcttgct ggtggatatt tggttcttga gccaatttat gatgcttatg tgacagcatt 180 

40 gtcatcacga atgcatgcag ttataacacc aaaaggaacc agtttgaaag aatctagaat 240 
caaaatttct tcaccccaat ttgcaaacgg agaatgggaa tatcacatat catcaaatac 300 
agagaagccc agagaagttc agtcacgcat aaatccattt ttagaggcaa ctatattcat 360 
cgttttagct tatattcaac cgaccgaagc atttgatctt gaaatcatca tttactcaga 420 
ccctggatat cattcacaag aagatactga aaccaagaca tcctcgaatg gagaaaaaac 4 80 

45 atttctttac cattctcgtg ccattaccga agtggaaaag accggattag gttcatcggc 540 
aggattagtg tcagttgttg ccacaagttt attatcccat tttatcccca atgttatcag 600 
tacgaataaa gatattttgc acaacgttgc acagattgca cattgttatg cccaaaaaaa 660 
gataggatct gggtttgatg ttgcaactgc aatttatggt ctgattgtat atagaagatt 720 
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tcagccagct ttgataaatg acgtgtttca ggttctagaa agtgatcctg agaagttccc 780 

cacagagttg aaaaaattga ttgaaagtaa ctgggaattc aaacatgaaa gatgtacatt 84 0 

accatacgga atcaagttat taatgggtga cgtcaagggt ggctcagaaa cacccaaatt 900 

ggtatcacga gtactccaat ggaaaaagga aaagccagaa gaaagctctg ttgtgtatga 96 0 

5 ccagcttaat agtgccaatt tacagtttat gaaggaattg agggaaatgc gtgaaaaata 1020 

cgactcagac ccagagactt atattaaaga gttagatcat tctgttgagc ctttgactgt 1080 

tgcgattaag aacatcagaa aagggttaca agcattaaca caaaaatcag aggttccaat 114 0 

tgaacctgat gtccaaaccc agttgttgga ccgttgtcaa gagattcctg gttgtgttgg 1200 

tggtgtggtt ccaggtgctg gtggatacga tgcaatagct gtattagtgt tggaaaatca 1260 

10 agtgggaaat tttaagcaga aaactcttga aaatccagat tattttcata atgtttactg 1320 

ggttgatttg gaagagcaaa cagaaggtgt acttgaagaa aaaccagaag actatatagg 13 8 0 

tttataaaat atcactggga tatgtctaca aggtgttttc gattagagtt tttgatcccc 1440 

attttaacat attttacttc aatcttacac tttatccttt taagtaggta tgtgtaggga 1500 

aagagcctga tcttcataaa ccgttgcaaa ctaattgatt atattttcta ttgtaaattt 1560 

15 catatgcagg aaatagctta ttcgacaaat tatttatttt cgtctcgttc tggtccaagt 1620 

accccagaga cgaaataact gacaacacgc agggctgggt tggcattttc gtcacacgat 16 80 

tattattaat ggtaacaaaa aaaggggrka tgcccgtggt cgatacacaa atatttatga 174 0 

tatactttcc atattttttt ttt 1763 

20 

<210> 6 
<211> 1299 
<212> DNA 

<213> Candida albicans 

25 

<400> 6 

atgtcaaaag catttagtgc acctggaaaa gcatttcttg ctggtggata tttggttctt 60 

gagccaattt atgatgctta tgtgacagca ttgtcatcac gaatgcatgc agttataaca 120 

ccaaaaggaa ccagtttgaa agaatctaga atcaaaattt cttcacccca atttgcaaac 180 

30 ggagaatggg aatatcacat atcatcaaat acagagaagc ccagagaagt tcagtcacgc 24 0 

ataaatccat ttttagaggc aactatattc atcgttttag cttatattca accgaccgaa 300 

gcatttgatc ttgaaatcat catttactca gaccctggat atcattcaca agaagatact 36 0 

gaaaccaaga catcctcgaa tggagaaaaa acatttcttt accattctcg tgccattacc 420 

gaagtggaaa agaccggatt aggttcatcg gcaggattag tgtcagttgt tgccacaagt 48 0 

35 ttattatccc attttatccc caatgttatc agtacgaata aagatatttt gcacaacgtt 540 

gcacagattg cacattgtta tgcccaaaaa aagataggat ctgggtttga tgttgcaact 600 

gcaatttatg gtctgattgt atatagaaga tttcagccag ctttgataaa tgacgtgttt 660 

caggttctag aaagtgatcc tgagaagttc cccacagagt tgaaaaaatt gattgaaagt 720 

aactgggaat tcaaacatga aagatgtaca ttaccatacg gaatcaagtt attaatgggt 78 0 

40 gacgtcaagg gtggctcaga aacacccaaa ttggtatcac gagtactcca atggaaaaag 84 0 

gaaaagccag aagaaagctc tgttgtgtat gaccagctta atagtgccaa tttacagttt 900 

atgaaggaat tgagggaaat gcgtgaaaaa tacgactcag acccagagac ttatattaaa 960 

gagttagatc attctgttga gcctttgact gttgcgatta agaacatcag aaaagggtta 1020 

caagcattaa cacaaaaatc agaggttcca attgaacctg atgtccaaac ccagttgttg 1080 

45 gaccgttgtc aagagattcc tggttgtgtt ggtggtgtgg ttccaggtgc tggtggatac 114 0 

gatgcaatag ctgtattagt gttggaaaat caagtgggaa attttaagca gaaaactctt 12 00 

gaaaatccag attattttca taatgtttac tgggttgatt tggaagagca aacagaaggt 1260 

gtacttgaag aaaaaccaga agactatata ggtttataa 1299 
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<210> 7 
<211> 432 
<212> PRT 

<213> Candida albicans 
<400> 7 

Met Ser Lys Ala Phe Ser Ala Pro Gly Lys Ala Phe Leu Ala Gly Gly 
15 10 15 

Tyr Leu Val Leu Glu Pro He Tyr Asp Ala Tyr Val Thr Ala Leu Ser 
20 25 30 

Ser Arg Met His Ala Val He Thr Pro Lys Gly Thr Ser Leu Lys Glu 
35 40 45 

Ser Arg He Lys He Ser Ser Pro Gin Phe Ala Asn Gly Glu Trp Glu 
50 55 60 

Tyr His He Ser Ser Asn Thr Glu Lys Pro Arg Glu Val Gin Ser Arg 
65 70 75 80 

He Asn Pro Phe Leu Glu Ala Thr He Phe He Val Leu Ala Tyr lie 
85 90 95 

Gin Pro Thr Glu Ala Phe Asp Leu Glu He He He Tyr Ser Asp Pro 
100 105 110 

Gly Tyr His Ser Gin Glu Asp Thr Glu Thr Lys Thr Ser Ser Asn Gly 
115 120 125 

Glu Lys Thr Phe Leu Tyr His Ser Arg Ala He Thr Glu Val Glu Lys 
130 135 140 

Thr Gly Leu Gly Ser Ser Ala Gly Leu Val Ser Val Val Ala Thr Ser 
145 150 155 160 

Leu Leu Ser His Phe He Pro Asn Val He Ser Thr Asn Lys Asp lie 
165 170 175 

Leu His Asn Val Ala Gin lie Ala His Cys Tyr Ala Gin Lys Lys lie 
180 185 190 

Gly Ser Gly Phe Asp Val Ala Thr Ala lie Tyr Gly Leu lie Val Tyr 
195 200 205 



Arg Arg Phe Gin Pro Ala Leu lie Asn Asp Val Phe Gin Val Leu Glu 
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210 215 220 

Ser Asp Pro Glu Lys Phe Pro Thr Glu Leu Lys Lys Leu lie Glu Ser 
225 230 235 240 

Asn Trp Glu Glu Lys His Glu Arg Cys Thr Leu Pro Tyr Gly lie Lys 
245 250 255 

Leu Leu Met Gly Asp Val Lys Gly Gly Ser Glu Thr Pro Lys Leu Val 
260 265 270 

Ser Arg Val Leu Gin Trp Lys Lys Glu Lys Pro Glu Glu Ser Ser Val 
275 280 285 

Val Tyr Asp Gin Leu Asn Ser Ala Asn Leu Gin Phe Met Lys Glu Leu 
290 295 300 

Arg Glu Met Arg Glu Lys Tyr Asp Ser Asp Pro Glu Thr Tyr lie Lys 
305 310 315 320 

Glu Leu Asp His Ser Val Glu Pro Leu Thr Val Ala lie Lys Asn lie 
325 330 335 

Arg Lys Gly Leu Gin Ala Leu Thr Gin Lys Ser Glu Val Pro lie Glu 
340 345 350 

Pro Asp Val Gin Thr Gin Leu Leu Asp Arg Cys Gin Glu lie Pro Gly 
355 360 365 

Cys Val Gly Gly Val Val Pro Gly Ala Gly Gly Tyr Asp Ala lie Ala 
370 375 380 

Val Leu Val Leu Glu Asn Gin Val Gly Asn Phe Lys Gin Lys Thr Leu 
385 390 395 400 

Glu Asn Pro Asp Tyr Phe His Asn Val Tyr Trp Val Asp Leu Glu Glu 
405 410 415 

Gin Thr Glu Gly Val Leu Glu Glu Lys Pro Glu Asp Tyr lie Gly Leu 
420 425 430 



<210> 8 
<211> 70 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Single- stranded oligonucleotide 
<400> 8 

5 aaatgtcaga gttgagagcc ttcagtgccc cagggaaagc gttactagct gcagctgaag 60 
cttcgtacgc 70 



<210> 9 
10 <211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

15 <223> Description of Artificial Sequence: Single-stranded oligonucleotide 
<400> 9 

agttatttat caagataagt ttccggatct ttttctttcc taacacccca ggcataggcc 60 
actagtggat ctg 73 

20 

<210> 10 
<211> 33 
<212> DNA 
25 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Single-stranded oligonucleotide 
30 <400> 10 

cccaagcttg gcaatgtcaa aagcatttag tgc 33 



<210> 11 
35 <211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

40 <223> Description of Artificial Sequence: Single- stranded oligonucleotide 
<400> 11 

ccgctcgaga ttttataaac ctatatagtc ttctgg 36 



45 
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